Competing Bandits: Learning Under Competition
نویسندگان
چکیده
Most modern systems strive to learn from interactions with users, and many engage in exploration: making potentially suboptimal choices for the sake of acquiring new information. We initiate a study of the interplay between exploration and competition—how such systems balance the exploration for learning and the competition for users. Here the users play three distinct roles: they are customers that generate revenue, they are sources of data for learning, and they are self-interested agents which choose among the competing systems. In our model, we consider competition between two multi-armed bandit algorithms faced with the same bandit instance. Users arrive one by one and choose among the two algorithms, so that each algorithm makes progress if and only if it is chosen. We ask whether and to what extent competition incentivizes the adoption of better bandit algorithms. We investigate this issue for several models of user response, as we vary the degree of rationality and competitiveness in the model. Our findings are closely related to the “competition vs. innovation” relationship, a well-studied theme in economics. 1998 ACM Subject Classification F.1.1 Models of Computation
منابع مشابه
Exploiting Competition Relationship for Robust Visual Recognition
Joint learning of similar tasks has been a popular trend in visual recognition and proven to be beneficial. Between-task similarity often provides useful cues, such as feature sharing, for learning visual classifiers. By contrast, the competition relationship between visual recognition tasks (e.g., content independent writer identification and handwriting recognition) remains largely under-expl...
متن کاملMarket power influential approach using game theory in a two competing supply chains with multi-echelons under centralized/decentralized environments
This paper is considering the competition between two multi-echelon supply-chains on price and service under balance and imbalance of market power between the chains which are analyzing through Nash and Stackelberg game approach. The problem is categorized as the centralized or decentralized structure of each chain, which means a few different possible scenarios are developing based on the Nash...
متن کاملReal-Time Competition Processes in Word Learning
Perceptual processes take time to unfold. Whether a person is processing a visual scene, identifying the category an object belongs to, or recognizing a word, cognitive processes involving competition across time occur. These ongoing competitive processes have been ignored in studies of learning. However, some forms of learning suggest that learning could occur while competition is ongoing, res...
متن کاملSequential Monte Carlo Bandits
In this paper we propose a flexible and efficient framework for handling multi-armed bandits, combining sequential Monte Carlo algorithms with hierarchical Bayesian modeling techniques. The framework naturally encompasses restless bandits, contextual bandits, and other bandit variants under a single inferential model. Despite the model’s generality, we propose efficient Monte Carlo algorithms t...
متن کاملOutsourcing through Three-dimensional Competition
In this paper, we study an outsourced supply chain consisting of one buyer and two suppliers in which the buyer outsources manufacturing of a physical product to two competing suppliers. The suppliers compete for the buyers' demands share, and the buyer allocates the demands to the competing suppliers based on three-dimensional allocation functions. We consider two certain types of allocation f...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2018